An Algorithm for Estimating Mixture Distribution of High Dimensional Vectors And Its Application to Character Recognition

نویسندگان

  • Fang Sun
  • Hirotomo Aso
چکیده

For statistical pattern recognition, in order to obtain high recognition accuracy, it is very important to estimate distribution precisely. In many cases, the distribution of feature vectors which are extracted from recognition objects is assumed to be normal, however it is more intricate and volatile in practice. It is thought to be more feasible to assume the distribution as mixed normal distribution. To estimate the mixed distribution precisely, a great number of training samples are required, especially in the case that the number of dimensions of feature vector is large. But unfortunately, compared with the number of dimensions, there are always not enough training samples. For this reason, the mixed normal distribution estimation is rarely used in recognition problems using high dimensional vectors, for example, character recognition. In this paper, by introducing Simplified Mahalanobis distance to the maximum likelihood estimates, the mixed normal distribution estimation algorithm for high dimensional vectors is proposed. As a practical application, the estimation algorithm is adopted to character recognition. A multi-template dictionary is constructed with consideration of the distribution of each category. The effectiveness of the proposed method is examined by experiments using Japanese characters.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pattern Recognition with Gaussian Mixture Models of Marginal Distributions

Precise estimation of data distribution with a small number of sample patterns is an important and challenging problem in the field of statistical pattern recognition. In this paper, we propose a novel method for estimating multimodal data distribution based on the Gaussian mixture model. In the proposed method, multiple random vectors are generated after classifying the elements of the feature...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Application of Pattern Recognition Algorithms for Clustering Power System to Voltage Control Areas and Comparison of Their Results

Finding the collapse susceptible portion of a power system is one of the purposes of voltage stability analysis. This part which is a voltage control area is called the voltage weak area. Determining the weak area and adjecent voltage control areas has special importance in the improvement of voltage stability. Designing an on-line corrective control requires the voltage weak area to be determi...

متن کامل

Application of Pattern Recognition Algorithms for Clustering Power System to Voltage Control Areas and Comparison of Their Results

Finding the collapse susceptible portion of a power system is one of the purposes of voltage stability analysis. This part which is a voltage control area is called the voltage weak area. Determining the weak area and adjecent voltage control areas has special importance in the improvement of voltage stability. Designing an on-line corrective control requires the voltage weak area to be determi...

متن کامل

Novel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection

In this study, two novel learning algorithms have been applied on Radial Basis Function Neural Network (RBFNN) to approximate the functions with high non-linear order. The Probabilistic Evolutionary (PE) and Gaussian Mixture Model (GMM) techniques are proposed to significantly minimize the error functions. The main idea is concerning the various strategies to optimize the procedure of Gradient ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999